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METHODS FOR SEPARATING AND IDENTIFYING DNA MOLECULES 

The invention relates to methods for separating and 
identifying DNA molecules in mixtures of DNA molecules 
5 having the same number of nucleotides but different base 
sequences . 

1 . Background to the Invention 

10 1 . 1 General Introduction 

Genes exist as multiple alleles which differ from each 
other by small differences in sequence. Individuals are 
often heterozygous with respect to the alleles of 
15 particular genes; i.e. individuals often have two different 
alleles of the same gene. 

In some circumstances, it is desirable to separate the 
alleles of a gene from a mixture of the alleles. For 

2 0 example, when it is desired to carry out a test to 

determine which alleles of a gene are carried by a 
heterozygous individual, it is often necessary to separate 
the two alleles before carrying out the test because the 
presence of two alleles in one test can prevent meaningful 
25 results from being obtained. 

In view of the fact that the difference between the alleles 
of a gene can be as little as one nucleotide, it is often 
difficult to separate the alleles from a mixture of the 

3 0 alleles. These difficulties are increased in genes which 

have a very large number of different alleles, such as the 
major histocompatibility complex (MHC) genes (e.g. the 
human leucocyte antigen (HLA) class I genes which have 222 
known alleles) . 

35 

HLA matching between a bone marrow or kidney recipient and 
donor is one of the major factors influencing transplant 
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success . Up to date the most favourable bone marrow 
transplant (BMT) and kidney transplant results have been 
obtained using sibling donors who are genotypically HLA- 
identical to the recipient but such donors are available 
5 for only about 30% of patients (1 " 5) . BMT using unrelated 

donors can be successful, but these transplants have higher 
rates of graft failure, increased incidence and severity of 
Graft versus Host Disease and more frequent complications 
related to delayed or inadequate immune reconstitution <4) . 

10 

New molecular biological methods for detection of genetic 
polymorphism currently provide an opportunity to improve 
e.g. HliA matching of unrelated donors as well as a research 
tool to investigate the relationship between disparity and 

15 transplant complications. These molecular typing methods 

include sequence- specif ic amplification, hybridisation with 
oligonucleotide probes, heteroduplex analysis, single 
strand conformation polymorphism and direct nucleotide 
sequencing . 

20 - - 

Each of these molecular approaches has been used for 
routine HLA class II typing {6) , but a variety of reasons 
related to the HLiA class I gene structure has complicated 
their application to class I typing. The reasons for these 

25 limitations are the extensive polymorphism of each class I 
locus and the degree of sequence homology between the loci . 
In addition, sequence homology between class I classical 
and non-classical genes and the reported 12 pseudo genes 
can cause problems for specific locus amplif ication (7) . 

30 

The low extent of "allele specific" sequences at 
polymorphic sites is a feature of the HLA class I genes 
that has limited the resolution of all current DNA typing 
approaches. An "allele specific" sequence is a sequence 
35 that is only present in one allele and can therefore be 
used to distinguish the allele from other alleles. 



10 



20 



30 



The main problem which complicates the identification of an 
allele is the presence of a mixture of alleles, as well as 
contamination by segments of DNA which have homology to the 
allele it is wished to identify and which are co-amplified 
in PCR. Current typing methods are sometimes unable to 
resolve the allele it is wished to identify from 
contaminating DNA fragments. Separation techniques such as 
single strand conformation polymorphism (SSCP) can only 
partially resolve this problem. 

^ Sequence specific primer amp lification (pcR-g.gp) 



This method utilises both the group-specific and, when 
present, allele-specif ic sequence sites in PCR primer 
15 design. Although each SSP may not individually define the 
required specificity, the use of combinations of sequence- 
specific primers allows the amplification of their common 
sequences to give the desired HLA specificity. The SSP- 
design is based on the amplification refractory mutation 
system (ARMS), in which a mismatch at the 3' residue of the 
primer inhibits non-specific amplification' 



,(8,9) 



However, despite its high accuracy, PCR-SSP is only in some 
cases more informative than serology. The reason for this 
25 is the low extent of allele specific polymorphic sites in 
the exon sequences and this limitation has produced a vast 
amount of research into the identification of allele 
specific motifs localised in the intron sequences' 101 . 
However, up to date this information has not contributed 
considerably to the identification of more alleles. 



Another limitation of this method is that it detects a 
limited number of polymorphic sequences which are utilised 
to predict the entire sequence. If an unknown allele is 
35 present in a particular sample this extrapolation may be 

incorrect . 
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1 3 g-ir, q l^ B tra^ informa tion po lymorphism (SSCP j 
T^Is technique is based on the electrophoretic mobility of 
single stranded nucleic acids in a non-denaturing 
polyacrylamide gel, which depends mainly on sequence- 
related conf ormation'"""' . The technique can be employed 
for isolating single alleles which could then be used for 
further manipulation and analysis such as direct 
sequencing. The pattern of bands obtained after 
electrophoresis may be diagnostic for an allele. Although 
, the technique is easy to perform, the specificity and 

sensitivity are poor and its use has been limited to only 
few HLA class I subtypes ( 14 • 1S 1 . 

The successful use of the technique relies on group 
5 specific amplification and therefore prior knowledge of 
broad HLA specificity is needed. 

1.4 Heteroduplex an alysis 

0 Fully matched DNA duplexes are more stable than those with 
base mismatches. Instability of the duplex increases with 
the number of nucleotide mismatches which produce an 
increasing "drag effect" in polyacrylamide gels which 
retard the migrating bands 116 " 18 ' 



This technique exploits the formation of mismatched DNA 
hybrids (heteroduplex) which are formed at the end of each 
PGR cycle between coamplified alleles from a particular 
locus or loci. During the annealing stage of each cycle of 
30 the PCR, a proportion of sense strands of each allele may 
anneal to anti-sense strands of different alleles. A band 
pattern is obtained in PAGE analysis which can be useful^ 
for identifying the alleles involved in the reaction • . 

35 Heteroduplex analysis is an approach that has been utilised 
to compare HLA genes of a particular donor and recipient. 
HLA genes are amplified, denatured (melted into single 
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strands) and mixed together under conditions that promote 
renaturation to form double stranded molecules. If the HLA 
genes of a donor and recipient are similar but not 
identical, heteroduplexes will form consisting of one 
5 strand of an allele of donor origin and a second strand 
from a different allele of recipient origin (22 ' 23> . 
Heteroduplex molecules can be detected by altered 
electrophoretic migration. The sensitivity of this method 
can be increased by adding DNA from an HLA allele that is 
10 not present in the donor or recipient. 

The major advantage of heteroduplex analysis is that it is 
relatively easy and inexpensive. Limitations of this 
approach include inability to detect certain HLA 
15 disparities, potential detection of irrelevant silent 

mutations and lack of specific information regarding the 
nature of the alleles involved. 

Up to date this approach has been used only for HLA class 
20 II typing with limited success. It has not been applied to 
HLA class I analysis. 

1 . 5 Sequence specific oligonucleotide probes (PCR-SSO) 

2 5 SSO typing involves amplification of HLA alleles from a 

particular locus followed by hybridisation with a panel of 
oligonucleotide probes to detect polymorphic sequences that 
distinguish one allele or group of alleles from all others. 
This is sometimes referred to as low resolution or generic 

30 oligotyping. Following low resolution typing, selected 

primers can be used to achieve amplification of individual 
alleles which are then identified by specific probes. This 
second stage of oligotyping is often referred to as high 
resolution oligotyping <6) . 

35 

The major methodological drawback of this approach is that 
the complexity of the technique is directly related to the 
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number of alleles under investigation and the presence of 
two alleles in the heterozygous condition can complicate 
the identification process. 

5 Based on PCR-SSO several typing approaches for HLA-A and B 
have been published and the number of probes that they 
require are over 40 and 90 probes respectively (24,25) . This 
leads to a highly complex operation which is time consuming 
and the resolution obtained is only moderate. 

10 

Published oligotyping methods could result in incorrect 
interpretation of data if certain combinations of recently 
discovered alleles are present in a specimen 1261 . This 
limitation and the great number of probes needed have led 
15 several groups to explore the feasibility of automated 
sequencing for routine HLA class I typing . 

The advantages of PCR-SSO methods are specificity, 
sensitivity, simplicity, reproducibility, inexpensive and 
20 that they allow simultaneous processing of many samples. 

1 . 6 Direct Nucleotide Sequencing 

This technique involves PCR amplification using one 
25 biotinylated and one non-biotinylated primer. Following 
amplification, PCR products are captured by streptavidin- 
coated magnetic beads and the non-biotinylated strand is 
removed, this permits unidirectional solid-phase sequencing 
of purified single -stranded DNA (6) . 

30 

Sequencing methods can be differentiated by whether the 
template has been cloned or produced directly from genomic 
DNA by an initial PCR amplification. Sequencing approaches 
can also employ either manual or automated techniques (14> . 
35 Cloned templates and templates produced from allele- 

specific PCR amplification represent a single sequence 
derived from one HLA haplotype. In contrast, alleles from 
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both HLA haplotype in heterozygous samples may be co- 
amplified and sequenced together using locus-specific PCR 
primer. The recent availability of software, which allows 
the user to align the derived sequence against established 
5 libraries, has facilitated the analysis and allele 
assignments for heterozygous samples in which both 
templates are sequenced at the same time <27> . The 
effectiveness of this method depends on the amount and 
frequency of ambiguous heterozygous combinations which for 
10 HLA class II are not many, but for class I the number of 
ambiguous combinations is so high that the obtained 
resolution is comparable with serology. 

Up to date two HLA class I typing approaches based on 
15 direct sequencing have been published, both require 
serology information followed by allele specific PCR 
amplification and then direct sequencing U4 ' 27) . Although 
this approach gives the highest resolution, it is expensive 
and difficult to perform routinely. 



20 



2 . Summary of the Invention 



The invention provides methods for separating and 
identifying a DNA molecule in a mixture of DNA molecules 
25 having the same number of neucleotides but different base 
sequences. The separation methods comprise 

(i) amplifying the DNA molecules in the mixture; 

3 0 (ii) hybridising single strands of the amplified DNA 

molecules with a complementary strand of a 
reference DNA molecule so as to form duplexes; 
and 

3 5 (iii) separating the duplexes. 



The different DNA molecules in the original mixture give 
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rise to duplexes having different numbers, positions or 
types of mismatches. This allows the duplexes to be 
separated by, for example, gel electrophoresis. The 
separated duplexes can then be analysed to identify the DNA 



embodiment of a method of the invention is illustrated in 
Figure 1 . 

The method of the invention can be used as a diagnostic 
10 technique by the use of a specific reference DNA molecule. 
The formation of a homoduplex identifies a DNA molecule in 
the unknown mixture as identical to the reference DNA 
molecule . 

15 The method can also be used as a separation technique for 

separating the alleles in a mixture of unknown alleles of a 
polyallelic gene, such as the mammalian MHC genes (e.g. the 
HLA genes) . Duplexes formed between the unknown alleles 
and a reference allele are separated so as to isolate the 

20 unknown alleles for identification by techniques such as 
DNA sequencing, SSP and SSO. 

In one embodiment, the invention provides a method which 
comprises 



5 



molecules that were present in the original mixture. One 



25 



(i) 



amplifying the DNA molecules in the mixture 
employing a pair of primers in which one of the 
primers carries a ligand, so as to produce an 
amplified mixture of double -stranded DNA 
molecules in which one of the strands carries a 
ligand; 



30 



(ii) 



contacting the amplified mixture of double- 
stranded DNA molecules with a receptor on a solid 
support under conditions such that the ligand 
binds to the receptor; 



35 



(iii) separating the mixture of double -stranded DNA 
molecules into single- strands and removing the 
strands that are not bound to the support by 
the ligand ; 

5 

(iv) recovering the remaining strands from the 
support ; 

(v) mixing the recovered strands with a complementary 
10 strand of a reference DNA molecule so as to form 

duplexes; and 

(vi) separating the duplexes. 

15 

The complementary strand of the reference DNA molecule may 
be provided by essentially the same technique as the 
technique set out above in steps (i) to (iv) for providing 
the mixture of DNA molecules in single-standard form. In 
20 particular, the reference complementary strand DNA molecule 
may be provided by 

(i) amplifying the reference DNA molecule employing 
a pair of primers in which one of the primers 

2 5 carries a ligand, so as to produce amplified 

double -stranded reference DNA molecule in which 
one of the strands carries a ligand; 

(ii) contacting the double -stranded reference DNA 

3 0 molecule with a receptor on a solid support under 

conditions such that the ligand binds to the 
receptor; 

(iii) separating the double -stranded reference DNA 
3 5 molecule into single -strands and removing the 

strand that is not bound to the support by the 
ligand; and 
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<iv) recovering the remaining strand from the support. 



In another embodiment of the invention, there is provided a 
5 method which comprises 



(i) amplifying the DNA molecules in the mixture 

employing a pair of primers in which one of. the 
primers carries a high molecular weight molecule, 
10 so as to produce an amplified mixture of double - 

stranded DNA molecules in which one of the 
strands carries a high molecular weight molecule; 



(ii) separating the mixture of double -stranded DNA 
15 molecules into single strands; 

(iii) mixing the single strands with a complementary 
strand of a reference DNA molecule so as to form 
duplexes; and 

20 

(iv) separating the duplexes . 



The complementary strand of the reference DNA molecule may 
25 be provided by 

(i) amplifying the reference DNA molecule employing a 
pair of primers in which one of the primers 
carries a high molecular weight molecule, so as 
30 to produce an amplified double -stranded reference 

DNA molecule in which one of the strands carries 
a high molecular weight molecule; and 



(ii) separating the double -stranded reference DNA 
35 molecule into single strands. 
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This embodiment overcomes the need for solid support 
systems by conjugating one primer of a pair of primers 
directly to a high molecular weight molecule (e.g. a 
protein) , for the reference and test systems. The 
5 amplified product after hybridisation can be applied 

directly to a separating gel. The high molecular weight 
conjugates are retained in the gel compared to the duplex 
without attachment of the high molecular weight molecule. 

10 In a further embodiment of the invention, there is provided 
a method which comprises 

(i) amplifying a single strand of each of the DNA 
molecules in the mixture; 

15 

(ii) mixing the amplified single strands with a 
complementary strand of a reference DNA molecule 
so as to form duplexes; and 

2 0 (iii) separating the duplexes. 

In this embodiment, the complementary strand of the 
reference DNA molecule may be provided by amplifying a 
single strand of the reference DNA molecule. The 
25 amplification of the single strand of the reference or test 
DNA molecule can be done, for example, by asymmetric PCR. 

This embodiment overcomes the need for both solid support 
systems and conjugation of one primer of a pair to a high 

30 molecular weight molecule. However, in the embodiment it 
is possible to use a primer carrying a ligand such as a 
hapten in order to facilitate capture of the amplified 
strand with a receptor such as an antibody and separation 
of the amplified strand from other components in the 

35 amplification mixture. 



After separation of the DNA molecules by one of the above 
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methods, the molecules present in the mixture may be 
identified by carrying out one or more of the following 
steps : 

5 (i) comparing the positions of the separated duplexes 

on the gel with the position of a control DNA; 

(ii) sequencing each of the separated molecules ; 

10 (iii) sequence specific primer (SSP) amplification 

analysis; and 

(iv) sequence specific oligonucleotide (SSO) analysis. 

15 The invention provides an improvement over prior methods 
for separating DNA molecules. The advantages offered by 
the invention can be summarised as follows : 

(a) The invention provides a high resolution between 
2 0 different DNA molecules and differences of as 

little as one nucleotide between molecules can be 
detected. 

(b) The invention allows simultaneous and rapid 
25 processing of a large number of samples. 

(c) The invention is comparatively inexpensive to 
perform, particularly when compared to prior 
methods which achieve a high level of resolution. 

30 

(d) The invention uses techniques that can be 
performed easily without recourse to complex and 
expensive technology . 

35 J3j. Principle underlying the invention 



Fully matched DNA duplexes are more stable than those with 
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base mismatches. Regions of nucleotide sequence 
complementary are double stranded, but mismatched regions 
form single -stranded loops along the length of the DNA 
molecule. The number, size, composition and position of 
5 the single-stranded loops vary for each combination of 
alleles. The rate at which the DNA migrates in 
polyacrylamide or special agarose gels depends on both 
molecular conformation and molecular weight. Heteroduplex 
DNA always migrates more slowly than the corresponding 
10 homoduplex DNA. Both denaturing reagents and/or heat 

enhance the degree of separation of the mismatched DNA 
strands (heteroduplexes) from the corresponding matched 
duplexes (homoduplexes) . 

15 As the molecular conformation of heteroduplexes can be 
manipulated by hybridisation of a known single strand 
reference with unknown complementary single strand (s) , it 
is proposed that heteroduplexes can be separated from each 
other by e.g. denaturing or non- denaturing polyacrylamide 

20 electrophoretic analysis. This allows the separation of 
the two amplified alleles from a particular locus for 
further analysis. In addition, the method of the invention 
(which we call "Complementary Strands Analysis" (CSA) ) , 
permits assessment of the quality of the PCR product before 

25 the process of identification is carried out. CSA is able 
to identify the presence of coamplified non-desirable 
alleles from different loci and, potentially, PCR fragments 
that contain artifacts such as Taq errors and in vitro 
recombinations . 

30 

In addition, CSA itself can be used as a diagnostic 
technique. It can identify alleles by hybridisation of 
allele specific single strand with unknown complementary 
single strand (s) followed by e.g polyacrylamide gel 
3 5 electrophoretic analysis, with or without denaturing 

conditions and/or with or without a temperature gradient . 
The formation of a homoduplex demonstrates identity between 




! 
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at least one of the unknown alleles and the allele specific 
reference, and non- identity between them produces 
heteroduplex (es) . 

5 4_l. Detailed Description of the Invention 

The kinds of DNA molecule that may be separated and 
identified by the methods of the invention include alleles 
of polyallelic genes, segments of genes and non-expressed 
10 fragments. 

Examples of genes with multiple alleles to which the 
invention may be applied are the mammalian MHC genes such 
as the HLA class I and class II genes, the T cell receptor 

15 genes in mammals < 30 ' 3:u , TAP, LMP, ras l29) , non classical HLA 
class I genes, the genes for human complement factors C4 
and C2, Bf in the human HLA complex, and genes located in 
mitochondrial DNA, bacterial chromosomes and viral DNA. 
The invention can be used in the analysis and 

20 identification of mutations (e.g. point mutations) in these 
and other genes and chromosomal aberrations such as 
translocations, deletions and inversions, 

There are three different genes within the HLA class I 
25 group of genes, namely HLA-A, HLA-B and HLA-C, and each of 
these three genes exists in the form of multiple alleles. 
There are a total of about 222 known alleles of the HLA-A, 
HLA-B and HLA-C genes and the sequences of known alleles 
are set out in Arnett and Parham (1995) Tissue Antigens 45 
30 217-257. There are also multiple genes within the HLA 
class II group of genes, known as DR, DQ and DP. 

In the method of the invention, it is necessary to identify 
primer sequences unique for the target gene so as to 
35 include all polymorphic sites of interest in the amplified 
fragment, which should also be manageable in length. For 
example, the polymorphic sites in exons 2 and 3 of HLA 
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class I would facilitate the identification of all 
recognised alleles of HLA-A, B and C, with 5 exceptions, if 
locus-specific amplification is achieved. Therefore, the 
primers used in the invention may, for example, be selected 
5 so as to specifically amplify exons 2 and 3 of each of HLA- 
A, HliA-B and HLA-C separately. Cereb (28) and collaborators 
have described primer sequences located in the first and 
third exons which can be used for locus-specific 
amplification of the entire exon 2 and 3 region of each of 
10 the HLA-A, HLA-B and HLA-C genes. The sequences of 
suitable primers are given in the Example below. 

The reference DNA molecule used in the invention generally 
has a known sequence. The reference may be chosen so as to 

15 have a similar allotype to an allotype that at least one of 
the test alleles is suspected of having. For example, it 
may be known that a test allele is of the HLA-A02 type from 
serological data, but it may not be known which of the 
seventeen A02 sub- types the allele is. In this case, the 

20 reference allele may be chosen to be of sub-type A0201 and 
the method of the present invention could then be used to 
determine which of the A02 sub-types the test allele is. 

The reference strand may be obtained from (a) a homozygous 
25 source, (b) a heterozygous source from which individual 

strands are isolated by gel separation after amplification 
steps or (c) DNA synthesis. There are now about 500 
internationally recognised cell lines which contain HLA 
alleles of known sub-type and these cell lines can be used 
30 as a source of reference alleles. 

The initial broad typing of the test allele may be 
performed using serological techniques. Different alleles 
of HliA induce different types of antibody and the antibody 
3 5 induced by an unknown HLA therefore gives information as to 
the allotype of the HLA. However, serological typing does 
not achieve a high level of resolution and is not 



r 
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sufficiently accurate or reproducible for matching a 
prospective donor with a prospective recipient in a tissue 
or organ transplant operation. 

5 The control DNA used in the method of the invention may be 
a homoduplex between two strands of the same DNA molecule 
(e.g. the reference DNA molecule), so that migration of a 
test duplex to the same position on the electrophoretic gel 
as the control homoduplex indicates that the test duplex is 
10 a homoduplex. If the test duplex is a homoduplex, it can 
be concluded that the unknown DNA molecule is the same as 
the reference molecule. 

Control DNAs may be obtained by simply amplifying a known 
15 DNA molecule using the same primers as used in the method 
of the invention to amplify the reference and unknown 
molecules . 

The mixture of alleles used in the method of the invention 
20 may be from a prospective donor or a prospective recipient 
in a tissue or organ transplant operation. The results of 
the method may therefore be used to match a prospective 
recipient with a prospective donor. 

25 In one embodiment of the invention, the alleles of the 

prospective donor or of the prospective recipient are in 
effect used as reference alleles and duplexes are formed 
between strands of the prospective recipient's alleles and 
of the prospective donor's alleles. Analysis of the 

30 duplexes formed between the strands from the prospective 
recipient and donor reveals whether they have compatible 
alleles. Thus, in one embodiment, the invention provides a 
method for determining whether a prospective recipient in a 
tissue or organ transplant operation has alleles of a gene 

35 that are compatible with the alleles of a prospective donor 
in the operation, which method comprises 
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(i) amplifying the alleles of the prospective 
recipient employing a pair of primers in which 
one of the primers carries a ligand, so as to 
produce amplified double -stranded alleles of the 

5 prospective recipient in which one of the strands 

carries a ligand; 

(ii) contacting the amplified double -stranded alleles 
with a receptor on a solid support under 

10 conditions such that the ligand binds to the 

receptor; 

(iii) separating the double -stranded alleles into 
single -strands and removing the strands that are 

15 not bound to the support by the ligand; 

(iv) recovering the remaining strands from the 
support ; 

2 0 (v) mixing the recovered strands with complement airy 

strands of the alleles of the prospective donor 
so as to form test duplexes; 

(vi) separating the test duplexes by gel 

25 electrophoresis; and carrying out one or more of 

the following steps : 

(vii) comparing the positions to which the test 
duplexes migrate on the gel with the position of 

3 0 a control DNA; 

(viii) sequencing one or both strands of each of the 
test duplexes; 



35 



(ix) sequence specific primer (SSP) amplification 
analysis; and 
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(x) sequence specific oligonucleotide (SSO) analysis. 

Other proposed uses of the invention include determination 
of the paternity of an individual by identifying one (or 
5 more) of his alleles to see if it is the same as a 

corresponding allele of a potential father. The invention 
may also be used in forensic medicine to determine the 
origin of a sample of body tissue or fluid, as a follow up 
technique in treatment of haematological malignancies or 
10 inherited disorders, in adoptive immunotherapy, and in 
identification of bacteria and viruses. 

In the method of the invention, the amplification steps may 
be carried out by polymerase chain reaction (PCR) . 

15 

The ligand/receptor system used in the invention may, for 
example, be the biotin/streptavidin system or a 
hapten/antibody system. Direct conjugation of the primer 
via a linking group, such as short poly A, to the beads is 

20 an alternative. When the biotin/streptavidin system is 

used, one of the primers used in each of the amplification 
steps may be labelled with biotin, so that when the 
amplification reaction is carried out double -stranded DNA 
is produced in which one strand carries a biotin label. 

25 The double -stranded DNA may then be bound to a solid 
support coated with streptavidin. 

The solid support used in the invention is typically 
magnetic beads. However, other supports may be used, such 

30 as the matrix of an affinity chromatography column. When 
the support is in the form of magnetic beads, the two 
strands of the amplified DNA are separated by attracting 
the beads to a magnet and washing the beads under 
conditions such that the double- stranded DNA dissociates 

35 into single-strands. The dissociation is typically 
performed by incubating the beads three times under 
alkaline conditions (e.g. 0.1 M NaOH) at room temperature 



for about 5 minutes. Usually, the strand which is not 
bound to the support by the ligand is then discarded, 
although it is equally possible to retain the strand that 
is not bound to the support and discard the strand that is 
bound to the support. 

The strand that remains attached to the support may be 
recovered from the support by incubating the support under 
conditions such that the ligand/receptor complex 
dissociates. When the biotin/streptavidin system is used, 
the support is typically heated to e.g. 95 °C for about 5 
minutes; this ensures denaturation of the streptavidin 
molecule to release the biotinylated single strand which is 
then recovered. 

At this stage, there have been provided a single -stranded 
unknown allele and the complementary strand of a reference 
allele. The two strands are then mixed together under 
conditions in which they hybridise to form duplexes. 
Typically, the hybridisation step is performed by heating 
the mixture of strands at about 95 °C for about 3 min, at 
about 70 °C for about 5 min and then at about 65 °C for about 
4 5 min. 

Under these conditions, duplexes are formed which can 
subsequently be separated by gel electrophoresis (e.g. 
polyacrylamide gel electrophoresis) . The electrophoresis 
is preferably carried out under denaturing conditions 
because this has the effect of amplifying the loops formed 
by mismatched strands and hence allows better resolution. 

As an alternative separation technique to gel 
electrophoresis, high pressure liquid chromatography (HPLC) 
may be used. 

In the embodiment of the invention in which one of the pair 
or primers is conjugated to a high molecular weight 
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molecule, the molecule may be a protein such as bovine 
serum albumin (BSA) . The molecular weight of the high 
molecular weight molecule is such that it causes the DNA 
molecule to which it is attached to be sufficiently 
5 retarded in the separation step (e.g. the electrophoresis 
step) to allow the DNA molecule to be separated from a 
duplex without a high molecular weight compound attached. 
For example, the molecular weight of the high molecular 
weight molecule may be from 10 to 200 kDa, preferably 20 to 
10 100 kDa. 

Use of the invention to analyse ras oncogene point 
mutations : - ras has been implicated in the oncogenesis of 
many tumours and appears to be activated by point 
15 mutations. These mutations can occur in all three ras 

genes <N-ras, Harvey-ras and Kirsten-ras) at codons 12/13 
and 61 with corresponding amino acid substitutions in ras 
proteins (p21) . These point mutations can be detected by 
application of the invention. 

20 

Two pairs of primers are needed, one for the 12/13 codons 
and one for codon 61. The primers described by Lyons t29> 
can be used, with modification by covalent attachment of a 
ligand to one primer of each pair for each of the test 

25 fragment and the reference fragment. For example, a ligand 
may be attached to primer Pla (a 12/13 codon primer) and 
primer Plb (a 61 codon primer) for the reference fragment, 
and to primer P2a (a 12/13 codon primer) and primer P2b (a 
61 codon primer) for the test fragment. In this way, 

3 0 complementary ligand- labelled single strands for the 
reference and test fragments are obtained. The 
complementary strands are hybridised and subjected to 
electrophoresis. Detection of a homoduplex between the 
test fragment and a mutant reference fragment will indicate 

35 the presence of the mutation in the test fragment. 

Use of the invention to identify T cell receptor (TCR) 
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rearrangements in T cell tumours and in adoptive 
immunotherapy : - some T cell tumours can be monoclonal in 
origin and a proportion of the T cells from a patient may- 
carry a particular rearrangement of the T cell variable 
5 domain genes alpha/beta or gamma/delta depending on the T 
cell type. The efficacy of a particular treatment or the 
course of the disease can be evaluated by the 
identification of the malignant clone TCR rearrangement. 
The method of the invention with the use of suitable number 
10 of controls can be made semi- quant at ive, which would allow 
the evaluation of the progress of the treatment or the 
disease . 

In adoptive immunotherapy, a specific rearrangement of the 
15 variable domain genes of the TCR can be used as a marker 
for the selected cytotoxic T cell that has been generated 
in- vitro. Post infusion fate of these cells can be 
monitored by a semi -quantat ive detection of the particular 
rearrangement . 

20 

In both methods the T cell variable domain gene primers 
(30 - 31) can be modified by covalent attachment of a ligand at 
the 5' end of the primer pair, and the reference strand . 
will be selected to be complementary to the test DNA 
25 fragments. 

EXAMPLE 

The following Example illustrates the invention. 

30 

1 . Brief Description of the Drawings 

Figure 1 shows a schematic overview of an embodiment of the 
method of the invention. 

35 

Figure 2 shows the band pattern formed between the sense 
strand of LOB41265 (A*0101) with the anti-sense strand (s) 
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of STEINLIN (A*0101) , WIN <A1), DAUDI (A*0102, A*6601) , BM21 
(Al) and E4181324 (Al) under the following conditions: 8% 
polyacrylamide, 2 M urea, 10% formamide, 2 00 Volts during 6 
hrs at room temperature . 

5 

Figure 3 shows duplexes formed between the sense strand of 
JY (A*0201) with the anti-sense strand (s) of M7 (A*0202, 
A*0301) , DK1 (A*0203, A*3301) , RML (A*0204) , WT49 (A*0205) , 
CLA (A*0206, A24), KNE (A*0207, A*0201) , KLO (A*0208, A*0101) 
10 and OZB (A*0209, A*0301) under the following conditions: 8% 
polyacrylamide, 2 M urea, 10% formamide, 200 Volts, 6 hrs, 
at room temperature. 

Figure 4 shows the position of a heteroduplex band compared 
15 to a homoduplex control. The heteroduplex band corresponds 
to the hybridisation between the sense strand of JY 
(A*0201) and the anti-sense strand of RML (A*0204) which 
differ from each other ny only one nucleotide. Denaturing 
conditions were: 8% polyacrylamide, 3 M urea, 15% 
20 formamide, 200 Volts, 6 hrs. at 50°C. 

2 . Methods 

2 . 1 Summary 

25 

Amplification and isolation of the biotinylated anti-sense 
strand (s) were performed. They were then hybridised with 
allele specific reference sense single strands. 

3 0 In order to test the resolving power of this method, 7 
allele specific reference sense single strands were 
prepared. These were hybridised with several isolated 
anti-sense strand HLA-A alleles which were selected to 
include alleles with one or several nucleotide differences 

3 5 compared to the reference strand. 



Following this step, PAGE analysis was performed under 
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several different denaturing conditions. Urea at a range of 
2-6 M and formamide between 10%-30% concentrations were 
used and the gels were run at either room temperature or 
50-58°C (200 Volts for 6 hours) . 

5 

The identification of a homoduplex in the denaturing PAGE 
gels indicated identity between the allele specific single 
reference strand and at least one of the alleles from the 
unknown DNA sample . 

10 

As a positive control, double strand DNA PCR product from 
the allele reference was always used. 

2 . 2 Locus specific amplification of HLA class I genes 

15 

For typing purposes amplification of exons 2 and 3 is 
desirable, and the primers were therefore selected to 
amplify the stretch of the genome between intron 1 and 
20 intron 3 . The localisation and nucleotide sequences of the 
HLA locus-specific primers used are given in the reagents 
section . 

PCR reactions were performed in a total volume of 100/xl 
25 using 1/zg of genomic DNA and 25 pmoles of each locus- 
specific primer. The 3 ' -primer was biotinylated at 5' -end. 
This arrangement ensures the incorporation of the 
biotinylated primer onto the amplified antisense DNA 
strand. PCR conditions are given in the following table. 

30 

Thermocvcling conditions 
A and C loci 95° C 4 min. 1 cycle 

35 

95°C 30 sec. 

70°C 50 sec. 33 cycles 



72°C 
72°C 



3 0 sec. 



8 min. 



1 cycle 



B locus 



95°C 



4 min. 



1 cycle 



95°C 



3 0 sec. 



65°C 



50 sec 



33 cycles 



72°C 



3 0 sec 



72°C 



8 min. 



1 cycle 



2 . 3 Separation of the amplified DNA strands 
2.3a Removal of non-biotinvlated strand: 

Magnetic beads with covalently coupled streptavidin on the 
surface were added to the PCR product and incubated for 3 0 
minutes at 43 °C. In this way the amplified PCR product was 
immobilised by the interaction of biotin and streptavidin. 
After incubation, the tubes were placed against a magnet 
and the beads were washed with washing buffer to remove the 
remaining PCR reaction components. 

The non-biotinylated DNA strand was then dissociated from 
the beads by incubation with 0.1 M NaOH at room temperature 
(r.t.) for 5 minutes (X3). Following this the beads were 
washed to remove excess NaOH and re suspended in 50 /il of 
hybridisation buffer. 

2.3b Removal of biotinvlated DNA strand: 

The bead suspension was heated at 95°C for 5 minutes; this 
ensures denaturation of the streptavidin molecule to 
release the biotinylated amplified anti- sense single strand 
which was then removed and placed in a clean tube. 
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At this stage, the isolates contained single biotinylated 
DNA strands from each allele. 

2 . 4 Preparation of allele specific reference sincxle- 
5 stranded DNA 

DNA was extracted from 10th IHW cell lines. The following 
homozygous cell lines were selected as allele specific 
reference DNA: JY (A*0201) , RML (A*0204) , WT49 (A*0205) , 
10 T7527 (A*0206), KRC-005 (A*0212) , AMALA (A*0217) and 
L0541265 (A*0101) . 

The PCR conditions for amplification were as above, with 
the exception that in each case the locus -specific 5'- 
15 primer was biotinylated (5' -end). The PCR products were 
analysed by PAGE to assess the fidelity of the 
amplification and in all cases a single band was obtained. 

2 . 5 Hybridisation 

20 

The biotinylated anti-sense strand (s) from above were mixed 
with the sense strands, and the mixture was heated at 95 °C 
for 3 min., incubated at 70°C for 5 min., and then at 65°C 
for 45 min. Under these conditions, the sense and anti- 
25 sense strands were hybridised. The heteroduplexes formed 
could subsequently be separated from each other by 
electrophoresis in polyacrylamide gel. 

3 . Reagents ; 

30 

A) Nucleotide sequences of primers used for locus -specific 
amplification: 

5' A locus primer: GAA ACG/C GCC TCT GT/CG GGG AGA AGC AA 
35 (Intron 1: 21-46) 



3' A locus primer: TGT TGG TCC CAA TTG TCT CCC CTC 



(Intron 3: 66-89) 

5' B locus primer: GGG AGG AGC GAG GGG ACC G/CCA G 
(Intron 1: 36-57) 

3' B locus primer: GGA GGC CAT CCC CGG CGA CCT AT 
(Intron 3 : 37-59) 

5' C locus primer: AGC GAG GG/TG CCC GCC CGG CGA 
(Intron 1: 42- 61) 

3' C locus primer: GGA GAT GGG GAA GGC TCC CCA CT 
(Intron 3: 12-35) 



B) Buffers: 

Washing buffer: 

Hybridisation buffer: 
PCR buffer: 

TE buffer 



10 mM Tris-HCl pH 7.5 

1.0 mM EDTA 

2.0 M NaCl 

20 mM Tris-HCl pH 8 . 4 

50 mM KC1 

20 mM Tris-HCl pH 8 . 4 

50 mM KC1 

0.2 mM MgC12 

10 mM Tris-HCl pH 7 . 5 

1 mM EDTA 



C) Various 

Dynabeads M-280 Streptavidin (10 mg/ml) 
Magnetic particle concentrator -Dynal MPC 
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A Thermal cycler (PTC- 2 00 Peltier Thermal Cycler MJ Research) 

Ultrapure dNTP set, 2' -Deoxynucleoside 5' -Triphosphate 
( Pharmacia Biotech) 

5 

Taq DNA Polymerase (Gibco BRL) 
50 mM MgCl2 
10 0.1 M NaOH 

SeaPlaque Agarose (Flowgen Instruments Ltd) 

Protogel, 30% Acrylamide and 0.8% Bisacrylamide (National 
15 Diagnostics) 

4 . Results: Identification of alleles by CSA 

Seven allele specific reference sense strands were isolated 

2 0 from homozygous cell lines and hybridised to several anti- 

sense strands from other cell lines whose HLA specificity was 
defined by sequencing and in some cases only by serology. 

Alleles could be identified by homoduplex formation with a 
25 reference strand, which would migrate at the same rate as the 
double stranded DNA reference band (control band) in 
denaturing polyacrylamide gel. Non-identity would lead to 
heteroduplex formation and cathodic band(s) compared with the 
control band. 

30 

Under the denaturing conditions of 2 M Urea and 10% formamide 
at room temperature, it was observed that when the reference 
strand matched completely the anti-sense strand a single 
homoduplex band was visible in the gel. In cases when the 

3 5 reference strand differed by 3 or more bases from the anti- 

sense strand, a cathodic band corresponding to a heteroduplex 
was seen. This pattern was observed reproducibly for all 
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the allele specific reference strands and is therefore 
independent of the position of the mismatch (es) on the 
strands and the specific base sequence of the allelic 
reference (Figures 2 and 3) . 

Under these conditions, the heteroduplex band containing 1 or 
2 base mismatches was indistinguishable from the homoduplex 
band. By altering the denaturing conditions to 3 M urea and 
15% formamide at 50 °C, it was possible to distinguish between 
heteroduplexes with one or two mismatches from the homoduplex 
control band (Figure 4) . 
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CLAIMS 



1 . A method for separating a DNA molecule from a mixture 
DNA molecules having the same number of nucleotides but 

fferent base sequences, which method comprises: 

(i) amplifying the DNA molecules in the mixture ; 

(ii) hybridising single strands of the amplified DNA 
molecules with a complementary strand of a 
reference DNA molecule so as to form duplexes; and 

(iii) separating the duplexes. 

2 . A method according to claim 1 which comprises 

(i) amplifying the DNA molecules in the mixture 
employing a pair of primers in which one of the 
primers carries a ligand, so as to produce an 
amplified mixture of double -stranded DNA molecules 
in which one of the strands carries a ligand; 

(ii) contacting the amplified mixture of double -stranded 
DNA molecules with a receptor on a solid support 
under conditions such that , the ligand binds to the 
receptor; 

(iii) separating the mixture of double -stranded DNA 
molecules into single-strands and removing the 
strands that are not bound to the support by the 
ligand; 

(iv) recovering the remaining strands from the support; 

(v) mixing the recovered strands with a complementary 
strand of a reference DNA molecule so as to form 
dup 1 exe s ; and 



(vi) separating the duplexes. 



3. A method according to claim 2 wherein the 
complementary strand of the reference DNA molecule is 
provided by 

(i) 'amplifying the reference DNA molecule employing a 
pair of primers in which one of the primers carries 
a ligand, so as to produce amplified double -stranded 
re f ere nce DNA molecule in which one of the strands 
carries a ligand; 

(ii) contacting the double -stranded reference DNA 
molecule with a receptor on a solid support under 
conditions such that the ligand binds to the 
receptor; 

(iii) separating the double -stranded reference DNA molecule 
into single-strands and removing the strand that is 
not bound to the support by the ligand; and 

(iv) recovering the remaining strand from the support. 
4 . A method according to claim 1 which comprises 

(i) amplifying the DNA molecules in the mixture 
employing a pair of primers in which one of the 
primers carries a high molecular weight molecule, so 
as to produce an amplified mixture of double- 
stranded DNA molecules in which one of the strands 
carries a high molecular weight molecule; 

(ii) separating the mixture of double -stranded DNA 
molecules into single strands; 

(iii) mixing the single strands with a complementary strand 
of a reference DNA molecule so as to form duplexes; 
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and 

(iv) separating the duplexes. 

5 5 . A method according to claim 4 wherein the 

complementary strand of the reference DNA molecule is 
provided by 

(i) amplifying the reference DNA molecule employing a 
10 pair of primers in which one of the primers carries 

a high molecular weight molecule, so as to produce 
an amplified double -stranded reference DNA molecule 
in which one of . the strands carries a high molecular 
weight molecule; and 

15 

(ii) separating the double -stranded reference DNA 
molecule into single strands. 

6 . A method according to claim 1 which comprises 

20 

(i) amplifying a single strand of each of the DNA 
molecules in the mixture; 

(ii) mixing the amplified single strands with a 
25 complementary strand of a reference DNA molecule so 

as to form duplexes; and 

(iii) separating the duplexes. 

3 0 7. A method according to claim 6 wherein the 

complementary strand of the reference DNA molecule is 

provided by amplifying a single strand of the reference DNA 
molecule. 



35 



8 . A method according to any one of the preceding claims 
wherein the reference DNA molecule has a known sequence. 
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9 . A method according to any one of the preceding claims 
wherein the duplexes are separated by gel electrophoresis. 

10. A method according to claim 9 wherein the 
5 electrophoresis is performed under denaturing conditions. 

11. a method for identifying a DNA molecule in a 
mixture of DNA molecules having the same number of 
nucleotides but different base sequences, which method 

10 comprises separating the DNA molecules by a method as defined 
in claim 9 or 10, and comparing the positions of the 
separated duplexes on the gel with the position of a control 
DNA molecule . 

15 12. A method for identifying a DNA molecule in a mixture 

of DNA molecules having the same number of nucleotides but 
different base sequences, which method comprises separating 
the DNA molecules by a method as defined in any one of claims 
1 to 10 and sequencing each of the separated DNA molecules, 

20 carrying out sequence specific primer (SSP) amplification 
analysis or carrying out sequence specific oligonucleotide 
(SSO) analysis. 

13 . A method according to any one of the preceding claims 
25 wherein the mixture of DNA molecules is a mixture of alleles 
of a polyallelic gene. 

14. A method according to claim 13 wherein a reference 
allele is used which has the same serotype as at least one of 

30 the alleles in the mixture of alleles. 

15. A method according to claim 13 or 14 wherein the 
mixture of alleles is from a prospective recipient or a 
prospective donor in a tissue or organ transplant operation. 

35 

16. A method for determining whether a prospective 
recipient in a tissue or organ transplant operation has 
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alleles of a gene that are compatible with the alleles of a 
prospective donor in the operation, which method comprises 

(i) amplifying the alleles of the prospective recipient 
5 employing a pair of primers in which one of the 

primers carries a ligand, so as to produce amplified 
double -stranded alleles of the prospective recipient 
in which one of the strands carries a ligand; 

10 (ii) contacting the amplified double -stranded alleles 

with a receptor on a solid support under conditions 
such that the ligand binds to the receptor; 

(iii) separating the double -stranded alleles into single- 
15 strands and removing the strands that are not bound 

to the support by the ligand; 

(iv) recovering the remaining strands from the support; 

2 0 (v) mixing the recovered strands with complementary 

strands of the alleles of the prospective donor so 
as to form test duplexes; 

(vi) separating the test duplexes by gel electrophoresis; 
25 and carrying out one or more of the following steps: 

(vii) comparing the positions to which the test duplexes 
migrate on the gel with the position of a control 
DNA molecule; 



30 



35 



(viii) sequencing the test duplexes; 

(ix) sequence specific primer (SSP) amplification 
analysis ; and 

(x) sequence specific oligonucleotide (SSO) analysis. 



- 38 - 

17. A method according to claim 16 wherein the 
complementary strands of the alleles of the prospective donor 
are provided by 

(i) amplifying the alleles of the prospective donor 
employing a pair of primers in which one of the 
primers carries a ligand, so as to produce amplified 
double -stranded alleles of the prospective donor in 
which one of the strands carries a ligand; 

(ii) contacting the amplified double -stranded alleles 
with a receptor on a solid support under conditions 
such that the ligand binds to the receptor; 

(iii) separating the double -stranded alleles into single- 
strands and removing the strands that are not bound 
to the support by the ligand; and 

(iv) recovering the remaining strands from the support. 

18. A method according to claim 16 or 17 wherein the 
prospective donor is selected to have alleles of the same 
serotype as the prospective recipient . 

19. A method according to any one of claims 16 to 18 
wherein the control DNA is a homoduplex between two strands 
of the same allele and migration of the test duplexes to the 
same position on the gel as the homoduplex indicates that the 
prospective recipient and the prospective donor have the same 
alleles . 

20. A method according to any one of claims 13 to 19 
wherein the alleles are of a human leucocyte antigen (HLA) 
class I gene or an HLA class II gene. 

21. A method according to claim 20 wherein the alleles 
are of the HLA-A gene, the HLA-B gene or the HLA-C gene. 
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22. A method according to any one of claims 2, 3 and 9 
to 21 wherein the ligand is biotin and the receptor is 
streptavidin. 

5 23 . A method according to any one of claims 2, 3 and 9 

to 22 wherein the solid support is magnetic beads, and the 
strands which do not carry a ligand are removed by attracting 
the beads to a magnet and washing the beads under conditions 
such that the double- stranded DNA molecules dissociate into 
10 single strands. 

24 . A method according to any one of the preceding claims 
wherein amplification of the DNA molecules is performed by 
polymerase chain reaction (PCR) . 

15 

25. A method according to any one of claims 15 to 24 
wherein the prospective recipient and the prospective donor 
are a prospective bone marrow recipient and a prospective 
bone marrow donor, or a prospective kidney recipient and a 
2 0 prospective kidney donor. 





- 40 - 



ABSTRACT 



METHODS FOR SEPARATING AND IDENTIFYING DNA MOLECULES 

The invention provides a method for separating a DNA 
5 molecule from a mixture of DNA molecules having the same 
number of nucleotides but different base sequences, which 
method comprises: 

(i) amplifying the DNA molecules in the mixture; 



(ii) hybridising single strands of the amplified DNA 
molecules with a complementary strand of a 
reference DNA molecule so as to form duplexes; and 



The different DNA molecules in the original mixture give 
rise to duplexes having different numbers, positions or 
types of mismatches . This allows the duplexes to be 
20 separated by, for example, gel electrophoresis. The 

separated duplexes can then be analysed to identify the DNA 
molecules that were present in the original mixture. 



10 



15 



(iii) separating the duplexes. 
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